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TITLE 

A POSITIVE SELECTION VECTOR FOR THE 
BACTERIOPHAGE PI CLONING SYSTEM 
FTELD OF THE TMWWTTftM 

This invention allows the selection of large 
molecular weight DNA inserts in the PI cloning system by 
positive selection for the ability of clones with 
inserts to grow on media in the presence of sucrose. 
This new PI vector also facilitates characterization of 
cloned DNA. 

BACKGROUND OF THE INVENTION 

The bacteriophage PI cloning system allows the 
headful la vitro packaging of foreign DNA fragments as 
large as 95 kb in length. Sternberg, Proc. Natl. Acad. 
Sci. USA 87, 103-107 (1990) has shown that the PI 
cloning system can generate 100,000 clones containing 
inserts per microgram of vector DNA. Large molecular 
weight clones are faithfully replicated in £. eoli host 
strains and DNA from these clones can be easily isolated 
by standard molecular biological techniques. Thus, the 
PI cloning system rivals Yeast Artificial Chromosomes 
(YAC) and cosmid cloning systems for the generation and 
characterization of genomic libraries. 

Cosmid cloning vectors were designed by Bruning et 
al. r Gene 4 r 85-107 (1978> and Collins et al . , Proc. 
Natl. Acad. Sci. USA, 75, 4242-424 6 (1978), so that a 
bacteriophage lambda An vitro packaging reaction can 
encapsulate insert DNA up to 47 kb and infect £. coli at 
high efficiency. The cosmid vector plus insert DNA is 
cyclized in the £. coli bacterium at the lambda cos 
sites located on the vector. The same cos site is used 
in recognition by the lambda packaging apparatus for 
encapsulation of the vector-insert DNA into the lambda 
bacteriophage head. A major limitation of the cosmid 
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cloning system Is the relatively small size of the 
insert clone (47 Jcb) . Many eukaryotic genes have been 
shown to be larger than 50 Jcb with some genes (e.g., 
dystrophin) up to 1000 kb. The small size of cosmid 
5 clones necessitates a labor intensive and "error-prone" 
procedure of multiple chromosome "walking" and "jumping" 
methodologies when isolating large genomic clones. 

Another system for cloning large molecular weight 
DNA fragments is Yeast Artificial Chromosomes (YAC*s) 
10 developed by Burke et al.. Science 236, 806-812 (1987). 
YAC cloning enables DNA inserts up to 1000 Jcb to be 
propagated as minichromosomes in specific yeast strains. 
YAC vectors contain a yeast replication origin, a 
centromere, and a set of telomeres. After ligation of 
15 insert DNA to the YAC vector, the DNA is introduced into 
yeast spheroplasts by direct DNA transformation. The 
major limitation of YAC cloning is the inefficiency of 
the transformation reaction (about 1000 clones per 
microgram of vector DNA) and the difficulty in 
20 characterization of YAC clones once they have been 

generated. The YAC clone represents a small proportion 
(less than 1%) of the total DNA in a yeast cell. This 
makes recovery, isolation, and analysis of any 
particular YAC clone burdensome. 
25 The bacteriophage PI cloning system complements 

both cosmid and YAC cloning in the construction of 
genomic libraries. A 50, 000-member human DNA library 
has been generated in the PI cloning system by Sternberg 
et al.. The New Biol. 2, 151-162 (1990) which represents 
30 about a one times coverage of the human genome. The most 
recent PI cloning vector (pNS582tetl4AdlO) consists of a 
PI pac site used for the initiation of headful 
packaging, two PI lox sites which cyclize the PI vector 
upon introduction in an £di host strain containing 
35 the PI ere protein, a kanamycin gene for determining 



which £. gol j cells contain a PI plasmid, and a 
tetracycline gene for the cloning of insert DNA. The PI 
cloning vector also contains a bacteriophage PI plasraid 
replicon which maintains the PI clone at a single copy 
per cell, and an IPTG inducible PI lytic replicon for 
amplifying Pi clones in DNA isolation procedures. 
Another aspect of the cloning vector is a 10 Jcb "stuf fer 
fragment" from adenovirus DNA which gives flexibility in 
the headful packaging reaction. 

A model PI cloning reaction consists of cutting the 
pNS582tetl4AdlO with the restriction enzymes Seal and 
BamHI to generate 5 kb and 25 kb vector "arms". The 
digested vector DNA is then treated with calf intestine 
alkaline phospatase to inhibit self ligation of the 
vector. The vector arms are added to genomic DNA 
fragments that were previously digested with a BamHI-end 
compatible restriction enzyme (e.g. Sau3A) . The two 
DNA's are then ligated and a portion of the ligation 
mixture is added to the first part of the two stage PI 
in Yitrp packaging reaction. The first reaction 
consists of a cell extract prepared from PI infected £. 
CQ3 1 which is enriched for the PI pac cleavage proteins. 
After pac cleavage, the DNA mixture is incubated in the 
stage II PI Jjx vitro packaging reaction which consists 
of a £. CQli cell extract enriched for PI virion capsids 
and tails. The phage encapsulated DNA is then infected 
into an £. £Qli host strain that contains the ere 
recorabinase. A lox-lox. site specific recombination 
reaction effectively cyclizes the PI vector-insert clone 
which is maintained as a single copy extrachromosomal 
circular plasmid. To isolate DNA from a PI clone, the 
cell containing "the clone is grown in the presence of 
IPTG which induces the PI lytic replicon. This 
induction increases the copy number of the PI cone about 
25 fold which gives enough DNA (about 1 microgram) from 
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a 10 ml mini-alkaline lysis DNA isolation procedure for 
standard restriction mapping and size characterization 
procedures . 

One problem encountered in the Fl cloning system is 
5 that a significant number of PI vector molecules that 
contained no insert were present after a typical cloning 
experiment. These n no-insert " clones interfered with 
subsequent analysis of the cloning experiment in two 
ways. First, the number of clones to be screened when 
10 looking for a particular DNA insert was markedly 

increased due to the presence of "no-insert" containing 
clones. Secondly, upon subsequent growth of £. coll 
from a PI cloning experiment, the bacteria that 
contained a "no-insert" vector generally grew much 
15 better than clones that contained large DNA inserts. 

Therefore, after a few rounds of growth the population 
of £. eoli containing clones was greatly increased for 
"no-insert" vector clones. 

To overcome the problems encountered in the 
20 previous versions of the PI cloning system the 

pAdlO-SacBII positive selection PI cloning vector was 
developed. Many other positive selection based cloning 
systems have been developed for standard plasmid based 
recombinant DNA work. Henrich et. al., Gene 42, 345-34 9 
25 (1986) demonstrated a positive selection vector based on 
the E gene (lysis protein) of bacteriophage 0X174. Kuhn 

et. al.. Gene 42, 253-263 (1986) developed a system 
which uses the EcoRI endonuclease . Burns et. al.. Gene 
27, 323-325 (1984) showed that positive selection can be 

30 generated in a system based on resistance to 

5-f luorouracil. Other similar systems are listed in the 
Burns et al. article. 

Another positive selection system used in DNA 
cloning is based on the sacB gene from Bacillus 

35 subt ilis. This gene codes for the enzyme levansucrase 
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(sucrose : 2, 6-p-D-f ructan 6-fJ-D f ructosyltransf erase; EC 
2.4.1.10) which catalyzes the transf ructorylation of 
sucrose to various acceptor substrates resulting in the 
hydrolysis of sucrose and levan synthesis. Gay et al., 
J. Bacterid. 164, 918-921 (1985) demonstrated that the 
production of levansucrase in £. eoli is lethal in the 
presence of growth media containing 5% sucrose. Gay et 
al. have used this knowledge to develop a positive 
selection cloning system based on inactivating the £. 
subtilis pacB structural gene. This allows the growth 
of only those £. eoli bacteria containing recombinant 
clones that have DNA inserts when grown in the presence 
of sucrose. 

Tang et. al., Gene (in press) (1990), (U.S. Patent 
Application Serial No. 07/376,474) have cloned the gacB 
gene from Bacillus amylolig uefaciens and shown extensive 
DNA sequence homology to the sacB gene from £. SttbtiliS • 
When the sacB gene from £. amyloliguef aciens was cloned 
on a multicopy plasmid in £. coli . a lethal phenotype is 
observed when cells are grown in the presence of 
sucrose. This knowledge has inspired us to develop a 
novel PI positive selection cloning vector, 
pAdlO-SacBII. 

SUMMARY OF THF. TNVETJTTOTJ 

This invention describes a new bacteriophage PI 
cloning vector (pAdlO-SacBII) which allows for the 
positive selection of clones containing large molecular 
weight inserts. Improvements of the PI cloning system 
are : 



(a) a positive selection system based on the 
sacB gene from Bacillus amyloliquefaeiens 
which codes for the enzyme levansucrase 
(sucrose : 2, 6-f-D-f ructan 6-|J-D fructo- 

syltransferase; £. C.2 . 4 . 1 . 10) . A unique 
promoter cassette has been invented which 
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allows the disruption of aaeB expression 
by cloning foreign DNA into a unique BamHI 
site between the promoter and sacB 
structural gene. 
5 (b) a regulatory system based on the 

bacteriophage PI cl repressor protein to 
control the expression of the lethal sagR 
gene. 

(c) an £. cell host strain that contains an 
10 integrated lambda phage that expresses the 

PI cl gene. 

(d) placement of a bacteriophage T7 and Sp6 
RNA polymerase promoters which border the 
cloning site of the PI vector to 

15 facilitate characterization and analysis 

of PI clones. 

(e) placement of unique rare cutting 
restriction enzyme sites , Sfil Sail, and 
Not I which border the BamHI cloning site 
of the PI vector to facilitate 
characterization and isolation of cloned 
foreign DNA inserts. 

BRIEF PESCRTPTTnW QF THP FTft^c 

Figure 1 illustrates the steps involved in 
25 constructing the PI positive selection vector. 

Figure 2 illustrates the positive selection PI 
cloning vector pAdlO-SacBII. 

Figure 3 illustrates a Bglll/Xhol restriction 
digest of pAdlO-SacBII-human clones from (sucrose) and 
(no sucrose) agar plates. 

Figure 4 illustrates the ability to linearize and 
size characterize a PI clone using the rare restriction 
site Notl. 
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STATEMENT OF DEPOSTT 

The following plasmlds and bacteria relating to 
this invention have been deposited in the American Type 
Culture Collection, 12301 Parklawn Drive, Rockville, 
5 Maryland 20852-1776 under the Budapest Treaty. 

pAdlO-SacBII was designated ATCC Accession No. 
68505. 

pAdlO-SacBI was designated ATCC Accession No. 
68504. 

10 NS3607 was designated ATCC Accession No. 55135. 

DETAILED PBSCWTPTTOH OF THE tnvt?nt t ow 
The vector described herein is an improvement to 
the PI cloning system in the following manner: 

(1) The pAdlO-SacBII vector allows for the 
15 positive selection of large molecular weight DNA clones 

by killing clones that da not have a DNA insert when 
bacteria containing the clone are grown on media 
supplemented with sucrose. 

(2) Analysis of large molecular weight clones 
20 is facilitated by the ability to remove the vector DNA 

fragment away from the clone insert by cutting with the 
restriction enzymes NotI and Sfil or Sail which are 
unique to the vector and directly border the unique 
BamHI site used in cloning. 

25 (3) Chromosomal "walking" and "jumping" 

procedures will be greatly improved by the ability to 
make RNA and DNA probes from both ends of the DNA insert 
by using the bacteriophage T7 and Sp6 promoters that 
border the unique BamHI site. 

30 (4) DNA sequence information from both ends 

of the cloned DNA insert will be readily obtained by 
using the unique bacteriophage T7 and Sp6 promoters as 
sites for DNA sequencing primers. This sequence 
information can then be reported as Sequence Tagged 

35 Sites (STS) for each PI clone. 



# 



WO 92/14819 PCT/US92/01074 

8 

ConnrmrTlon of thft pftrtl D-SacBII Vector 
Construction of the PI positive selection cloning 
vector (pAdlO-SacBII) was initiated by cutting the 
parent PI vector pNS582tetl4AdlO (Figure 1), The New 
5 Biol. Zr 151-162 (1990), available from New England 

Nuclear as NENFHAGE®, with the restriction enzymes Sail 
and BamHI. This removed a 276 base pair fragment from 
the tetracycline gene of the parent vector. A synthetic 
duplex DNA oligonucleotide (promoter cassette) Lew in, B. 
10 (1983) Genes. John Wiley and Sons, NY, was then inserted 
into the Sall-BamHI site in a two step process. First, 
a 52 base pair duplex oligonucleotide. (The upper 
strand is referred to as SEQ ID NO. 1; the lower strand 
is referred to as SEQ ID NO. 2.) 

15 

5 1 TCGAGCTTGA CATTGTAGGA CTATATTGCT CTAATAAATT TGCGGCCGCT TG 3' 
CG AACTGTAACA TCCTGATATA ACGAGATTAT TTAAACGCCG GCGAACCTAG 



that contained a consensus £. coll promoter sequence, a 
20 ... PI cl repressor sequence that overlapped the £. eoli 

synthetic promoter, and a unique NotI site, was inserted 
into the Sail and BamHI site of the parent vector while 
regenerating the BamHI site but destroying the Sail 
site. Next, a 37 base pair duplex synthetic 
25 oligonucleotide- (The upper strand is referred to as 

SEQ ID NO. 3; the lower strand is referred to as SEQ ID 
NO. 4.) 

5 f GGCCGCGGAT CCGTCGACGG CCAATTAGGC CTACGTA 3' 
30 CGCCTAG GCAGCTGCCG GTTAATCCGG ATGCATCTAG 

that contained a series of restriction sites (NotI, 
BamHI, Sail, Sfil, SnabI), was inserted into the first 
52 base pair oligonucleotide using a NotI and BamHI 
35 digest. The NotI site was regenerated while the BamHI 
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site was destroyed, leaving only a unique BamHI site 
located between the NotI and Sail sites. This series of 
genetic engineering steps created a PI plasmid construct 
named pAdlO-37 as shown in Figure 1. 
5 The next step in the construction of the PI 

positive selection vector was to transfer a 5.0 kilobase 
<kb) Clal DNA fragment which includes the pAdlO-37 
tetracycline gene with the promoter cassette insert, PI 
lytic replicon, and part of the kanamycin gene into the 
10 Clal site of pBR322. This was done to facilitate 
further experimental manipulations with a smaller 
plasmid (9 kb) rather than the 30 kb pAdlO-37 plasmid. 
This plasmid was named pBRS.O. The pAdlO-37 plasmid and 
the plasmid pBR322 were cut with the Clal restriction 
15 enzyme. The 5.0 kilobase fragment was isolated and 
ligated into the Clal site of pBR322. 

A pUC plasmid (pBESOl) (Figure 1) containing the 
aaca gene from Bacillus amvlQli^efacipn. (gift from 
Vansantha Nagarajan, E . I . du Pont de Nemours and 
20 Company, CR&D Microbiology, U.S. Patent Application 

Serial No. 07/376,474) was cut with restriction enzyme 
EcoRI which generated a 1.6 kb DNA fragment that 
contains the structural gene for sacB and the ribosome 
binding site but does not contain the endogenous sacB 
25 promoter. A 9 base pair EcoRI/SnaBI adapter. (The 

upper strand is referred to as SEQ ID NO. 5; the lower 
strand is referred to as SEQ ID NO. 6.) 



that contains an internal unique Spel restriction site 
was placed on each end of the 1 . 6 kb EcoRI DNA fragment. 
This was done so that the sacfi structural gene was 
35 bordered by unique Spel sites for easy physical 
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identification and characterization of clones containing 
this insert. The modified 1.6 Jcb saeB fragment was 
ligated via blunt ends, into the unique SnaBI site of 
pBRS.O. DNA from this ligation reaction was transformed 
5 into £. cnli strain DHSaiq (available from Bethesda 

Research Labs) that already contained a plasmid 
pACYC-Plcl, New England Biolabs. The pACYC-Plcl plasmid 
contains the gene for the PI cl repressor protein f which 
is needed to prevent expression of the sacB gene, which 
10 even under permissive growth conditions (no sucrose in 
the media) exhibits a lethal phenotype. The PI cl 
protein is expressed by bacteriophage PI to repress 
phage lytic functions during vegetative growth. This 
protein acts by binding an asymmetric recognition DNA 
15 sequence, usually in vicinity of an RNA polymerase 

promoter, Eliason and Sternberg, J. Mol. Biol. 198, 181- 
293 (1987) . The lethal phenotype is probably due to the 
strong consensus £. coli promoter regulating the over 
expression of the SacB gene to produce a periplasmic 
20 SacB protein. The PI cl repressor protein is able to 

block expression of the sa^B gene and allow replication 
of the otherwise lethal plasmid. Positive clones were 
identified by hybridization against the nick translated, 
radioactively labeled 1.6 kilobase fragment from plasmid 
25 pBESOl. The correct plasmid construct was confirmed by 
physical mapping with diagnostic restriction enzymes and 
by the sensitivity of £. coli DHSaiq cells containing 
the plasmid to growth media containing sucrose, even in 
the presence of the PI cl repressor protein This 
30 plasmid construct was named pBR5.0-17. 

The next step in the construction of the PI 
positive selection vector was to return the 6.6 kb Clal 
fragment (previously ,5 . 0 kb) of plasmid pBR5.0-17 to the 
parent pAdlO-37 vector. DNA from plasmid pBR5.0-17 was 
35 subjected to a partial Clal restriction digest, since 
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there is a Clal site in the sacs structural gene, and 
the 6.6 kb fragment was isolated and ligated to the 25 
Jcb Clal fragment of pAdlO-37. DNA from this ligation 
reaction was transformed into £• eoli strain DHSalq that 
5 contained a lambda prophage expressing the PI cl gene. 
Positive clones were identified by restoration of the 
kanamycin resistant phenotype and by physical mapping 
via restriction enzyme digests. This PI vector was 
named pAdlO-SacBI (Figure 1) . Initial characterization 
10 of bacterial cells containing this vector showed a 

lethal phenotype when grown on media containing sucrose. 
When DNA inserts were cloned into the unique BamHI site 
of pAdlO-SacBI, cells containing these constructs were 
imparted with the ability to grow on media containing 
15 sucrose. This result demonstrated the utility and 
success of the PI SacB positive selection vector. 

To further increase the utility of the PI SacB 
positive selection vector, a modification of the unique 
restriction site region (NotI, BamHI, Sail, and Sfil) 
20 was performed. The pAdlO-SacBI vector was cut with the 
restriction enzymes NotI and Sail and a 56 base pair 
duplex synthetic oligonucleotide containing the promoter 
sequences for the T7 RNA polymerase and Sp€ RNA 
polymerase with a new BamHI site between the promoters 
25 was inserted into the Notl/Sall site at a position which 
directly bordered the unique BamHI cloning site, thus 
deleting the original unique BamHI site. (The upper 
strand is referred to as SEQ ID NO. 4; the lower strand 
is referred to as SEQ ID NO. 8.) 

30 

5* GGCCGCTAAT ACGACTCACT ATAGGGAGAG GATCCTTCTA TAGTGTCACC TAAATG 3' 
CGATTA TGCTGAGTGA TATCCCTCTC CTAGGAAGAT ATCACAGTGG ATTTACAGCT 

The resulting fragment is referred to in SacBII (Figure 
35 1) . The plasmid containing SacBII is referred to as 



WO 92/14819 



PCI7US92/01074 



pAdlO-SacBII (Figure 2) . Confirmation of the correct 
plasmid construct was generated by restriction mapping 
and testing for the functional presence of the T7 and 
Sp6 promoters. We also observed that cells containing 
5 the pAdlO-SacBII vector exhibited a "less" lethal 

phenotype when grown without the PI cl repressor in the 
absence of sucrose* This may be due to an "attenuator- 
like" effect produced by the insertion of the T7 and Sp6 
promoter DMA sequences between the aaeB structural gene 
10 and its synthetic promoter. This is the final version 
of the PI positive selection vector and is named pAdlO- 
SacBII. Eacperiments describing the utility of this 
vector are described below. 

Construction of E. cnli host str^n 
15 expressing the PI el rAprA.^.cnr 

To replicate the pAdlO-SacBII vector the expression 
of levansucrase must be controlled- saeB gene 
expression under the control of the synthetic £. cnli 
promoter gives a lethal phenotype in the absence of 
20 sucrose. We therefore constructed a host strain which 
expresses the PI cl gene at a high enough level to 
inhibit levansucrase expression. This was accomplished 
by constructing a bacteriophage lambda prophage that 
contains the PI cl repressor gene and inserting the 
25 construct into fi. eoli DPSCtlq. 

Phage lambda <imm21-Pl:7A5b) is a phage containing 
a functioning PI cl repressor gene. It was generated 
from the starting phage lambda fdam lS fc575 £529 attX + 
Jjnm21) in which PI EcoRI fragment 7 (O'Brien, Genetic 
30 Maps, Cold Spring Harbor Press, Cold Spring Harbor, NY 

1990) was cloned at lambda map coordinate 65.8. EcoRI-7 
contains the PI ere gene, the PI loxP site, and the pi 
cl repressor gene, Sternberg et al. f J. Mol . Biol. 187, 
197-212 (1986) . Deletion mutants of the above starting 
35 phage were isolated and the location of the deleted DNA 
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determined by restriction mapping, as described in 
Eliason and Sternberg, J. Mol. Biol. 198, 261-293 
(1987). One mutant, designated lambda (imro21-Pl :7A5b) , 
contained a contiguous deletion of DNA extending from 
5 the attX site into the cloned PI sequences. PI BamHI- 
EcoRI fragment 8* Ccxfi containing EcoRI-BamHI fragment 
from Pi EcoRl fragment 7) was completely removed and 
part of PI BamHI fragment 9 was removed. In vivo loxP - 
£X& reactions indicated that this lambda deletion mutant 
construct was IqxP-, £££-. Further experiments showed 
that an E. £Qlt lysogen containing this prophage 
synthesized functional PI cl repressor protein. This 
strain is called NS3607. These results indicate that 
the A5b deletion terminates between PI lax? and the PI 
cl gene in the PI BamHI fragment 9 (O'Brien, supra) . 
The E. call DH5ttlq (Pi cl) was used as a host strain to 
prepare pAdlO-SacBIl plasmid DNA for cloning experiments 
described below. 



10 



15 



20 



25 



Prepflrarirm n-F ntrtin-fiarRTT p m 
Significant care must be given in the preparation 
of pAdlO-SacBIl DNA. Vector DNA is prepared by the 
cesium chloride density gradient method as described by 
Godson and Vapnek, Biochim. Biophys. Acta. 299, 516-522 
(1973). The observation that bacteria containing the 
SacBI vector expressed a lethal phenotype even in the 
absence of sucrose in the media resulted in a careful 
analysis of cesium chloride prepared SacBI DNA. 
Restriction analysis showed that a significant portion 
of the vector DNA was deleted in the region containing 
the aacfi gene. When this DNA was transformed into 
E. DH5CClq and the bacteria grown on agar plates 

containing 5% sucrose, only bacteria that contained the 
population of SacBI plasmids that had a deletion grew 
35 well. This result significantly compromises the utility 
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of the PI positive selection system since it generated a 
large background of clones not containing DNA inserts, 
exactly what the SacB positive selection was supposed to 
eliminate. This problem was overcome by the 
5 construction of the pAdlO-SacBII vector which contains 
an extra 43 base pairs of DNA (T7 and Sp6 RNA promoters) 
between the sfiCfi structural gene and its synthetic 
promoter, fi. xaali DHSalq (PI cl> NS3607 that contained 
this vector (SacBII) did not display the lethal 
10 phenotype in the absence of sucrose in the growth media. 
Still, careful analysis of vector DNA is necessary when 
preparing large amounts of DNA for cloning experiments. 
This is achieved by restriction analysis of vector DNA 
and by transformation of competent £• col* cells with 
15 vector DNA to determine the percentage of the plasmid 
population which are sucrose resistant. 

Restriction digest of TAdlO-SagBTT nwa 
Vector arms are prepared by a sequential double 
restriction digest of the SacBII DNA. Three micrograms 
20 of vector DNA was incubated with the restriction enzyme 
Seal (New England Biolabs) as per manufacture's 
instructions, in a 30 microliter reaction. The reaction 
was incubated at 37°C for one hour. The reaction volume 
was increased to 50 microliters with water, BamHI 
25 restriction buffer and 1 microliter of BamHI. The 
reaction was incubated for 10 minutes at 37°C. The 
digest reaction was then extracted with one volume 
phenol, one volume chloroform-isoamyl alcohol (24:1) 
saving the aqueous layer each time. The DN* was 
30 precipitated with two to three volumes of ice cold 

ethanol and 0.3 M sodium acetate and the mixture placed 
on ice for 30 minutes. The DNA was collected by 
centrifugation in a microcentrifuge at room temperature 
for 20 minutes, washed with 0.5 ml of 75% ethanol and 
35 the pellet air dried. The DNA pellet was resuspended in 
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20 microliters of TE buffer (10 mM Tris (pH 8.0) 1 mM 
EDTA) . 



Preparation nnrt analyst of high molecular 
We i ght inaftrt DNft for clonW into r>AdlO-fi„^TT 
5 There are many protocols for the preparation of 

genomic DNA that can be used in the PI cloning system. 
This depends upon the organism, tissue or cell type in 
question. The pAdlO-SacBII vector was tested on DNA 
prepared from human lymphGblastoid cell line €97 as 
10 described by Sternberg, U.S. Patent Application No. 

07/397,071, (1989) and Sternberg et. al., The New Biol. 
2, 151-162 (1990) . Briefly, DNA was isolated from lysed 
cells and fractionated on a 10-40% sucrose gradient. 
After dialysis, 0.5 ml of the large molecular weight DNA 
15 was incubated with 4 units of Sau3I restrictive enzyme 
and restriction buffer minus magnesium, overnight at 
4°C. This ensured adequate mixing of the Sau3l 
restriction enzyme with the viscous genomic DNA. 
Digestion was initiated by adding magnesium chloride to 
20 10 mM and aliquot s were removed at 10 different time 

points and the reaction terminated by adding EDTA to 20 
mM and heating at 70°C for 15 minutes. 

The Sau3l partial digest of genomic DNA was 
analyzed by removing 10% of each time point fraction and 
25 subjecting the DNA to agarose field inversion pulse gel 
electrophoresis. The DNA was fractionated via 
electrophoresis to resolve fragment sizes between 20 and 
200 Jtb by a 1% agarose gel in 0.5 X's TBE buffer (89 mM 
Tris pH 8,0, 89 mM borate, 2 mM ethylene diamine 
30 tetraacetic acid (EDTA) ) , for 4 hours at 180 volts with 
a 0.6 sec. forward, 0.2 sec. reverse, and ramp of 20, 
pulse conditions. The gel was stained with ethidium 
bromide and DNA visualized with uv fluorescence. Those 
aliquots that contained digested genomic DNA fragments 
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in the 70 to 100 kb range were used as substrate for 
cloning Into the pAdlO-SacBIl vector. 

Ligation and Packaging nf SaeBTT vegtor 
and genomic in^rt DMA 

5 The ligation of the SacBII vector arms to genomic 

DNA and its subsequent in vityn packaging into PI 
capsids has been described in detail in Sternberg, Proc. 
Natl. Acad, Sci. USA 87, 103-107 (1990) and Sternberg, 
U.S. Patent Application No. 07/397,071. 0.2 micrograms 
10 Scal/BaaHI digest SacBII vector arms were incubated with 
Sau3A digested genomic UNA or with TE buffer (no insert 
control experiment) in ligase buffer (50 mM Tris pH 7.5, 
10 mM MgCl2r 10 mM dithiothreitol, 50 mg/ml bovine serum 
albumin) for 30 minutes at 37°C. UNA ligase and ATP (to 
15 2 mM) were then added and the reaction incubated at 16°C 
overnight. The ligation reaction was then heated at 
65°C for 5 minutes and added to the first part of the 
two stage JLn vitro PI packaging reaction- The stage 1 
extract is a bacteriophage PI lysate that contains the 
20 enzymes needed to cleave the pae site of the PI cloning 
vector. The stage 2 extract is a bacteriophage PI 
lysate that contains the components (phage capsids and 
tails) necessary to form an infectious phage particle. 
The stage 1 reaction was incubated for 15 minutes at 
25 30°C and then transfered to the stage 2 reaction, which 
was incubated for 20 minutes at 30°C. The final 
reaction was diluted to a total volume of 180 
microliters with TMG buffer (10 mM Tris pH 8.0, 10 mM 
M 9Cl2, 0.1% gelatin) plus pancreatic DNase (10 ug/ ml) . 
30 Fifteen microliters of chloroform were added to the PI 
packaging reaction, and then stored at 4°C. 

Growth and DNA preparation of PI rlonpc 
containing Mcrh molf>m^r weight dna ^Qonc 
A 10 to 20 microliter aliquot from each PI 
35 packaging reaction was added to 0.1 ml of mid-log phase, 
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concentrated (10X) , £. eoli bacteria containing the ere 
recombinase (strains NS3145 or NS3529) . After a 10 
minute incubation at 37°c, one ml of L broth was added 
to the phage/cell mixture and then incubated for 30-60 
5 minutes at 37°C with shaking. The infected cells were 
pelleted in a microcentrifuge and then resuspended in 
0.15 ml of L broth and then spread on L agar plates that 
contained 25 ug/ml kanamycin with or without 5% sucrose. 
The plates were incubated at 37°C overnight, Kanamycin 
10 and kanamycin/sucrose resistant colonies were recorded. 
PI clone plasmid DNA was prepared by picking a 
single bacterial colony with a 50 microliter capillary 
pipet and adding it to 10 ml of L broth containing 
25 ug/ml kanamycin. The. cells were grown at 37°C for 
15 about 3 hours (early log phase) and IPTG was added to a 
final concentration of 1 mM. The cells were grown for 
another 5 hours and then pelleted. DNA was prepared by 
the alkaline lysis procedure of Birnboim et al., Nucl. 
Acids Res, 7 f 1513-1523 (1979). Plasmid DNA was 
20 resuspended in 40 microliters of TE buffer and 0.4 
micrograms RNase. 

Characterization nf gpnnmir ir\*+rr mja 
from PAdlO-SarrBTT PI e-lnno* 
The pAdlO-SacBII vector allows the characterization 
25 of insert DNA by use of the novel rare cutting 

restriction sites which border the BamHI cloning site 
and by the ability to make HNA probes from either or 
both ends of the genomic insert via the T7 and Sp6 
promoters. Physical mapping and size characterization 
30 of cloned insert DNA is facilitated by the ability to 

isolate the SacBII vector sequence away from the genomic 
DNA. Chromosome walking and DNA sequencing of cloned 
inserts is achieved by the utilization of the unique 
promoter sequences. Those skilled in the art will 
35 recognize that these features have been successfully 
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utilized in plasmid and cosmid cloning as described in 
Melton et al. r Nucl. Acids Res. 12, 7035-7056 (1984) and 
Wahl et al., Proc. Natl. Acad. Sci. USA 84 , 2160-2164 
(1987) • 

Physical characterization of PI clones was carried 
out by restriction enzyme digestion. Each digestion 
reaction contains 13 microliters of plasmid DNA from a 
particular pi clone, 1.5 microliters of restriction 
buffer, and 1.0 microliter of each restriction enzyme 
(either Bglll, NotI or NotI and Sail) . The restriction 
digest was performed at 37°C for 2 hours, then subjected 
to a phenol/chloroform extraction, and air dried to 
remove excess chloroform. If the volume was too large 
an ethanol precipitation step was also included and the 
DNA resuspended in 15 microliters of TE buffer. The DNA 
was then analyzed on a 1% agarose pulse field inversion 
gel electrophoresis in 0.5X TBE buffer using pulse 
conditions of 0.6 sec. forward, 0.2 sec. reverse, ramp 
20 at 120 volts for 15 hours. The gel was then stained 
with ethidium bromide and the DNA visualized by uv 
f luor es cen ce . 

EXAMPLE 1 

Transformation efficiencies pf uncut 
and cut/ligated pAdlO-SacBII 
To determine the efficiency of the positive 
selection feature of the pAdlO-SacBII vector, DNA 
transformation experiments were carried out. After a 
cesium chloride banded DNA preparation of the SacBII 
vector had been generated, the DNA was analyzed by a 
restriction digest with Bglll, Spel, and BamHI/Scal. 
The results showed that the correct size DNA fragments 
were produced with no apparent rearrangements observed. 
Two hundred nanograms of pAdlO-SacBII from two different 
DNA preparations was transformed into £. coli strains 
NS3145 (cre+) and DH5Iq (ere-) as described by Maniatis 
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et al., Molecular cloning: A Laboratory Manual (Cold 
Spring Harbor Laboratory, Box 100, Cold Spring Harbor, 
NY (1982) . Strain NS3145 maintains the SacBlI vector as 
a single copy plasmid due to ere mediated lox-lox 
5 recombination. Strain DH5Iq maintains the SacBlI vector 
as a multicopy plasmid. This difference is important 
since the jaacfi positive selection function must work 
with only one copy of the gene present per bacterial 
cell. The transformed cells were plated on L agar 
10 plates containing kanamycin or kanamycin/5% sucrose and 
grown overnight at 37°C. The number of resistant 
colonies were counted (or estimated) and are shown in 
part A of Table 1. The same pAdlO-SacBII DNA (100 ng 
each) was cut with the restriction enzymes BamHl, Sail, 
15 SacI/BamHI or Seal/Sail, extracted with 

phenol /chloroform, ethanol precipitated and resuspended 
in TE buffer (as described previously) . The digested 
DNA was then placed in a ligation reaction, incubated 
overnight at 16°c, and transformed into £. coli strains 
20 NS3145 and DHSlq (as described above) . Kanamycin and 
kanamycin/ sucrose resistant colonies were recorded and 
are shown in part B of Table 1. 
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N53145 (cre-f ) DHSIq(cre-) 

5 DNA Kan r Sucrose r Kan r Sucrose r 





A) 












uncut #1 


•10,000 


6 


•10,000 


2 




uncut #2 


•10,000 


7 


•10,000 


7 


10 


*) 












uncut 


361 


2 


405 


0 




BamHX 


135 


6 


90 


5 




Sail 


246 


5 


222 


3 




Scal/BamHI 


56 


7 


9 


13 


15 


Sacl/Sall 


107 


7 


34 


9 



The results in part (A) show that the positive 
selection efficiency, calculated as the number of kan-r 
colonies divided by the number of Jean /sucrose -r 
20 colonies, for the uncut SacBII vector was approximately 
1000 fold. This was true whether the sacs gene was at 
single (NS3145) or multicopy (DH5Iq) . These results 
indicate that only a small proportion of the 
pAdlO-SacBII DNA contains an inactive sacs gene or 
25 nonfunctional synthetic promoter. This "background" 
population is most likely due to single base mutations 
in the sacB structural gene and promoter region which 
change the expression of levansucrase either 
quantitatively or qualitatively, thus allowing growth on 
30 media containing sucrose. The results from part (B) 
indicate that molecular biological manipulations 
(restriction digestion and ligation reactions and 
phenol/chloroform extractions) do have an impact on the 
efficiency of the SacB positive selection system. The 
35 transformation efficiency was decreased even for the 

uncut DNA, although less DNA (100 ng as opposed to 200 
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ng) was used for each strain transformed. The positive 
selection efficiency was approximately 200 to >400 fold. 
This number dropped somewhat when the pAdlO-SacBII was 
cut at the unique BamHI and Sail sites between the sacB 
5 structural gene and the synthetic promoter. Since a 
ligation step is necessary to achieve a functional 
pAdlO-SacBII molecule, it is possible that plasmid 
dimers can form which Inactivate the aacB gene by a head 
to head or tail to tail arrangement. These plasmid 

10 dimers would still be kan-r but would no longer show 

sucrose sensitivity. When the SacBII vector was cut to 
generated "arms- via the Scal/BamHI or Sail double 
digest , transformation efficiencies were markedly 
decreased, this is not surprising since Seal creates a 

15 blunt end which is recalcitrant to ligation. 

Interestingly the positive selection efficiency was very 
poor (2 to 5 fold) . This may be due to the fact that 
under these conditions most of the kan-r colonies are 
from generated plasmids via aberrant ligation products 
20 that inactivate the .aacB gene hence giving a sucrose 
resistant phenotype. 

EXAMPLE 2 

DNA Packaging anri Pn«iti w <u»w^ nn 
Effiei^riP. o«F pAdlP-fi»r.PTT' 

25 To determine how well the positive selection system 

of the SacBII vector would perform during the 
construction of a genomic library, the following model 
experiment was done. Four micrograms of pAdlO-SacBII 
DNA was cut with the appropriate restriction enzyme (as 

30 described above) at 37*C for 3 hours. DNA was extracted 
with phenol/chloroform and then ethanol precipitated. 
Some samples were then treated with calf intestinal 
phosphatase (CIP) (see Table 2, part (A) below) by 
resuspending the DNA in 50 microliters of 50 mM Tris (pH 

35 8.0), 0.1 mM EDTA. Calf intestinal phosphatase was 
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added to a final concentration of 0.01 units and the 
reaction incubated at 37°C for 1 hour. The CIP reaction 
was extracted with phenol/chloroform, ethanol 
precipitated and resuspended in 20 microliters of TE 
5 buffer. The cut SacBII vector DNA was then placed in a 
ligation reaction in the absence of foreign DNA, under 
the following conditions: -3 ug vector DNA (18 ul) , 3 
ul 10X ligase buffer, 2 ul 25 mM ATP, 1.5 ul T4 DNA 
ligase, 5 ul TE buffer. The DNA was first heated at 
10 70°C for 2 minutes and then the rest of the ligation 
reaction added and Incubated at 16°C overnight. The 
ligation reaction was then heated at 70°C for 3 minutes 
and added to the two stage PI in vitro packaging 
reaction as described previously. 
15 In part (B) of Table 2, partially Sau3A digested 

human DNA was used in the ligation reaction with the 
pAdlO-SacBXI vector. Two hundred nanograms of vector 
DNA was digested with Seal and BamHI and added to Sau3A 
partially digested human genomic DNA (as described 
20 previously) and incubated at 37°C for 30 minutes. 
Ligation reactions were set up as follows: 1.5 ul 
(200 ng) vector DNA, 15 ul (-1 ug) human genomic DNA, 
2 ul 10X ligase buffer, 1 ul 25 mM ATP, 1 ul T4 DNA 
ligase and incubated overnight at 1€°C. Tris-EDTA 
25 buffer was substituted for the vector control reaction. 
Three different time fractions (6, 8, and 10 minutes) of 
the human genomic Sau3A partial digest were used. The 
ligation reactions were heated for 3 minutes at 70°C and 
added to the PI in vitro packaging reaction as described 
30 previously. The number of kanamycin and kanamycin/ 
sucrose resistant colonies were recorded and are 
presented in Table 2. 
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TABLE ? 

DMA Pur Vainer utid F ft8 Hi»4w» 

Effiffii»nffv of pAriin>fii.rWTT 



Vector DMA 



Insert DNA 



NS3529(cxe+) 
Kan r Sucrose* 



10 



15 



20 



25 



30 



A) 

uncut 
BamHI 
Seal 

BaxnHI (CZP) 
Seal (CZP) /BamHI 
ScaZ/BamHZ (CZP) 
B) 

Scal/BamMI 
Scal/BaraHI 
Scal/BamHI 
Sea I /BamHI 



none 
none 
none 
none 
none 
none 

none 
Hu-6" 
Hu-8" 
Hu-10* 



-2000 
-1500 
-2000 

-250 
-1500 

-350 

800 
686 
518 
574 



1 

200 
0 
5 
42 
28 

8 

516 
328 
418 
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These results show that the SacBII vector 
significantly decreases the amount of kan-r colonies 
when plated on media containing 5% sucrose. In part (A) 
of Table 2 the SacBII vector was cut with restriction 
enzymes BamHI or Seal and then religated. 
Interestingly, only the BamHI cut increased the number 
of sucrose resistant colonies while the Seal cut, which 
is distant from the sacB structural gene, did not. This 
data is in accord with the data from Table 1 which 
showed an increase in the number of sucrose resistant 
colonies when the region between the saeB gene and its 
promoter were manipulated via restriction digest and 
ligation. Phosphatase treatment with CIP did decrease 
the total number of kanamycin resistant colonies when 
the BamHI site was treated. Also, CIP treatment 



WO 92/14819 PCT/US92/01074 

24 

improved the ratio of sucrose-r colonies to kan-r 
colonies . 

Part (B) of Table 2 reflects the experimental 
conditions in which genomic libraries can be generated 
5 using the PI cloning system. As stated previously, one 
problem with the old PI cloning vector (pAdlO) was the 
number of kan-r colonies generated with no insert. As 
can be seen from the control Scal/BamHI with no insert 
DMA the number of kan-r colonies are decrease 100 fold 
10 when plated on media containing sucrose. This result 

demonstrates that the SacBII vector greatly reduces the 
contribution of potential "no-insert " containing clones 
during a genomic cloning experiment. This result is 
further supported by the data generated when genomic DNA 
15 is added to cut SacBII vector during the ligation 
reactions. The number of sucrose-r colonies is 
increased over 60 fold in the Hu-6" reaction. Also, the 
relative number of sucrose-r colonies divided by kan-r 
colonies is greatly decreased indicating that most of 
20 the SacBII vector has productively ligated to genomic 
DNA fragments. It is that fraction of "insert -less" 
clones which grow on the kan-r plate but not on the 
sucrose-r plate that the SacBII positive cloning system 
was designed to select against. The following example 
25 demonstrates that the sucrose-r clones do contain 
genomic DNA while some of the kan-r clones do not. 

EXflMPLS 3 

Analysis and Characteri^atinn of SacKJT 
HiQh Molecular Wfiicht Human Tlrm^ 

30 To demonstrate that the pAdlO-SacBII sucrose 

resistant colonies actually contain genomic DNA inserts, 
DNA was prepared from a number of colonies from both the 
kan-r and sucrose-r populations generated in the 
packaging experiment described in Example 2. These 

35 clones were analyzed via restriction mapping with 
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Bglll/Xhol and fractionation on agarose gel 
electrophoresis. The SacBIl-human DNA 1 s were also 
characterized by NotI restriction digest and 
fractionation via pulse field inversion gel 
5 electrophoresis . 

DNA was prepared (as described above) from €4 
colonies from the kanamycin/5% sucrose plates and 12 
colonies from the kanamycin plates from the Hu-6 W and 
Hu-8" reactions described in Example 2. Thirteen 
10 microliters of DNA was digested with Bglll and Xhol at 
37 C C for 1 hour. The reaction was stopped by heating at 
70°C for 5 minutes in the presence of stop dye buffer 
<6X - 0.25% bromqphenol blue, 0.25% xylene cyanol, 40% 
(w/v) sucrose in water) and the DNA fragments were then 
15 fractionated on a 1% agarose gel for 12 hours at 20 
volts in IX TBE buffer. The gels were stained with 
ethidium bromide and DNA visualized by uv fluorescence. 
Results show that almost all (>95%) of the sucrose-r 
clones contain a genomic DNA insert but that many of the 
kan-r clones (-66%) do not contain DNA inserts. This 
difference is due to the positive selection aspect of 
the pAdlO-SacBII vector. 

A photograph of a gel that is a composite of a 
representative portion of the sampled colonies is 
25 presented in Figure 3. Lane 1 contains molecular weight 
markers, lanes 2-11 contain DNA from colonies derived 
from the kanamycin/sucrose agar plates, lanes 12-15 
contain DNA from colonies from the kanamycin plates. 
The DNA from clones in lanes 2-4 are SacBII vectors that 
contain relatively small genomic inserts. Lanes 5-11 
are SacBII clones that contain large molecular weight 
human genomic inserts. Lanes 12-15 contain DNA with no 
genomic DNA inserts and reflect the DNA fragments 
generated from the pAdlO-SacBII vector after it has gone 
35 through ere mediated lox-lox recombination. Background 
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fragments are due to the presence of the F* plasmid that 
contains the laclq gene recombinase in £. coli strain 
NS3145. 

To further characterize the SacBII PI clones DNA 
5 from a portion of the clones presented in Figure 3 were 
subjected to a NotI restriction digest and then 
fractionated on a 1% agarose field inversion gel 
electrophoresis (methodology as described above) . A 
photograph of this gel is presented in Figure 4. Lanes 
10 1, 15-18 are molecular weight markers, lanes 2-5 are 
clones from Jcanamycin plates (no sucrose) , lanes 6-14 
are clones from the Jcanamycin /sucrose agar plates. The 
clones from the no sucrose plates contain no genomic DNA 
inserts and the size of the SacBII fragment after 
15 lox-lox recombination is about 18 kb. The clones from 
the plus sucrose plates contain DNA inserts of two 
general sizes. The large molecular weight clones range 
in size from 75 to 100 kb. The low molecular weight 
clones are in the 30 to 50 kb range. DNA's in lanes 2-8 
20 are from strain NS3145 and those in lanes 9-14 are from 
strain NS3529. The smaller NotI fragment in lane 8 is 
from the F'laeTg plasmid in strain NS3145 and the larger 
fragment is from the SacBII-human plasmid. These 
results demonstrate the power of rare restriction site 
25 mapping (in this case NotI) in the analysis of PI clones 
using the new pAdlO-SacBIl vector. 

From the foregoing description, one skilled in the 
art can easily ascertain the essential characteristics 
of this invention and without departing from the spirit 
30 and scope thereof, can make various modifications and 
changes of the invention to adapt to various uses and 
conditions . 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
i) APPLICANT: 

(ii) TITLE OF INVENTION: 



(iii) 
(iv> 



(v) 



(vi) 



(vii) 



(viii) 



NUMBER OF SEQUENCES: 



Nat L. Sternberg, Ph.D. 
James C. Pierce, Ph.D. 
A Positive Selection Vector 
for the Bacteriophage PI 
Cloning System 
8 



CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: E. I . du Pont de Nemours and 
Company 

1007 Market Street 
Wilmington 
Delaware 
U.S.A. 



STREET : 
CITY: 
STATE : 
COUNTRY: 

ZIP: 19898 
COMPUTER READABLE FORM 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 



(B) 
(C) 
(D) 
(E) 
(F) 



(D) SOFTWARE: 

CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: UNKNOWN 
<B) FILING DATE: UNKNOWN 
(C) CLASSIFICATION: UNKNOWN 

ATTORNEY /AGENT INFORMATION: 

(A) NAME: R. Thomas Gallegos 

(B) REGISTRATION NUMBER: 32, 692 



Diskette, 3.50 inch, 1.0MB 
Macintosh 

Macintosh System Software, 
Version 6.0 

Microsoft Word, Version 4.0 



(C) Reference /Docket Number: 
TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: (302) 892-7342 

(B) TELEFAX: (302) 892-7949 

(C) TELEX: 835420 



CR-8930 
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SEQ LISTING 
(2) INFORMATION FOR SEQ ID NO:l: 

(i) Sequence Characteristics: 

(A) LENGTH: 52 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic oligonucleotide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) ORIGINAL SOURCE: 

(A) ORGANISM: not applicable 

(B) STRAIN: not applicable 

(C) CELL TYPE: not applicable 

(vi) IMMEDIATE SOURCE: 

(A) LIBRARY: not applicable 

(B) CLONE: not applicable 

(vii) FEATURES, SEQUENCE IDENTIFICATION SEQ ID NO:l: 

(viii PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TCGAGCTTGA CATTGTAGGA CTATATTGCT CTAATAAATT TGCGGCCGCT TG 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) Sequence Characteristics: 

(A) LENGTH: 52 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic oligonucleotide 

(iii) HYPOTHETICAL; NO 

(iv) ANTI-SENSE: NO 

(v) ORIGINAL SOURCE: 

(A) ORGANISM: not applicable 

(B) STRAIN: not applicable 

(C) CELL TYPE: not applicable 
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(vi) IMMEDIATE SOURCE : 

(A) LIBRARY: not applicable 

(B) CLONE: not applicable 

(vii) FEATURES, SEQUENCE IDENTIFICATION SEQ ID NO: 2: 

(viii PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GATCCAAGCG GCCGCAAATT TATTAGAGCA ATA7A67CCT ACAATGTCAA GC 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i> Sequence Characteristics: 

(A) LENGTH: 37 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic oligonucleotide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(v) ORIGINAL SOURCE: 

(A) ORGANISM: not applicable 

(B) STRAIN: not applicable 

(C) CELL TYPE: not applicable 

(vi) IMMEDIATE SOURCE: 

(A) LIBRARY: not applicable 

(B) CLONE: not applicable 

(vii) FEATURES, SEQUENCE IDENTIFICATION SEQ ID NO: 3: 

(viii PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGCCGCGGAT CCGTCGACGG CCAATTAGGC CTACGTA 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) Sequence Characteristics: 

(A) LENGTH: 37 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: synthetic oligonucleotide 



(I 
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(iii) HYPOTHETICAL: NO 
(±v) ANTI-SENSE: NO 

(v) ORIGINAL SOURCE: 

(A) ORGANISM: not applicable 

(B) STRAIN: not applicable 

(C) CELL TYPE: not applicable 

(vi> IMMEDIATE SOURCE: 

(A) LIBRARY: not applicable 

(B) CLONE: not applicable 

(Vii) FEATURES, SEQUENCE IDENTIFICATION SEQ ID NO: 4: 

(viii PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GATCTACGTA GGCCTAATTG GCCGTCGACG GATCCGC 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) Sequence Characteristics: 

(A) LENGTH: 9 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: synthetic oligonucleotide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

<v) ORIGINAL SOURCE: 

(A) ORGANISM: not applicable 

(B) STRAIN: not applicable 

(C) CELL TYPE: not applicable 

(vi) IMMEDIATE SOURCE: 

(A) LIBRARY: not applicable 

(B) CLONE: not applicable 

(vii) FEATURES, SEQUENCE IDENTIFICATION SEQ ID NO: 5: 
(viii PUBLICATION INFORMATION: 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CCACTAGTC 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(1) Sequence Characteristics: 

(A) LENGTH: 13 bases 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: synthetic oligonucleotide 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(v) ORIGINAL SOURCE: 

(A) ORGANISM: not applicable 

(B) STRAIN: not applicable 

(C) CELL TYPE: not applicable 

(vi) IMMEDIATE SOURCE: 

(A) LIBRARY: not applicable 

(B) CLONE: not applicable 

<vii) FEATURES, SEQUENCE IDENTIFICATION SEQ ID NO: 6: 
(viii PUBLICATION INFORMATION: 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AATTGACTAG TGG 
(2) INFORMATION FOR SEQ ID N0:7: 

(i) Sequence Characteristics: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: synthetic oligonucleotide 

(iii) HYPOTHETICAL: NO 

( i v) ANTI -SENSE : NO 

(v) ORIGINAL SOURCE: 

(A) ORGANISM: not applicable 

(B) STRAIN: not applicable 

(C) CELL TYPE: not applicable 

(vi) IMMEDIATE SOURCE: 

(A) LIBRARY: not applicable 

(B) CLONE: not applicable 



# 



WO 92/14819 PCT/US92/01074 

32 



(vii) FEATURES, SEQUENCE IDENTIFICATION SEQ ID NO: 7: 

(v±±i PUBLICATION INFORMATION: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

GGCCGCTAAT ACGACTCACT ATAGGGAGAG GAICCTTCTA TAGTGTCACC TAAATG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) Sequence Characteristics: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic oligonucleotide 

(iii) HYPOTHETICAL: NO 
<iv) ANTI -SENSE: NO 

(v) ORIGINAL SOURCE: 

(A) ORGANISM: not applicable 
<B) STRAIN: not applicable 
(C> CELL TYPE: not applicable 

(vi) IMMEDIATE SOURCE: 

(A) LIBRARY: not applicable 

(B) CLONE: not applicable 

(vii) FEATURES , SEQUENCE IDENTIFICATION SEQ ID NO: 8: 

(viii PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



TCGACATTTA GGTGACACTA TAGAAGGATC CTCTCCCTAT AGTGAGTCGT ATTAGC 
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What is claimed is: 

1. A positive selection cassette, comprising: 
5 (a) a gene, the expression of which is lethal 

to a host cell when the host cell is grown 
under nonrepressed conditions; 
(b) a synthetic consensus Esch^H^s a coli 
promoter upstream from the lethal gene; 
10 (c) a PI cl represSor sequence overlapping the 

promoter; and 
(d) a cloning site between the lethal gene and 
the promoter, the cloning site bordered by 
an Sp6 and a T7 RNA promoter, wherein 
15 cloning a nucleic acid into the cloning 

site prevents expression of the lethal 
gene . 

2. A positive selection cassette, comprising: 
20 (a) a SacB gene from Bacillus 

(b) a synthetic consensus Esch^rirhia coli 
promoter upstream from the SacB gene; 

(c) a PI cl repressor sequence overlapping the 
2 5 pr omot e r ; an d 

(d) a cloning cassette comprising Sp6 and T7 
promoters, bordered by BamHI restriction 
site, the Sp6 and T7 promoters, Sfil rare 
restriction site and a NotI restriction 
site, the cloning site located between the 
synthetic consensus promoter and the SacB 
gene, wherein cloning of a nucleic acid at 
the cloning site prevents expression of 
the SacB gene allowing growth of the host 
organism in a medium containing sucrose . 
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cassette 



3. 



A vector comprising tlie positive selection 
of Claim 1. 



5 



4. 



A positive selection cassette, comprising: 



15 



10 



<d) 



(c) 



(b) 



(a) 



a gene, the expression of which is lethal 
to a host cell when the host cell is grown 
under nonrepressed conditions; 

a Synthetic consensus Escherichia coli 
promoter upstream from the lethal gene; 
a PI cl repressor sequence overlapping the 
promoter; and 

a cloning site between the lethal gene and 
the promoter, wherein cloning a nucleic 
acid into the cloning site prevents 
expression of the lethal gene. 



5. A vector comprising the positive selection 
cassette of Claim 4. 

20 



a PI cl repressor sequence, 

7. A host cell comprising a PI cl repressor 
25 sequence and a positive selection cassette. 

8. The host cell of Claim 7 further comprising a 
vector which comprises the PI cl repressor sequence, 

30 9. The host cell of Claim 7 further comprising a 

vector which comprises the positive selection cassette, 

10, The host cell of Claim 7 wherein the positive 
selection cassette is the SacBI cassette. 

35 



6. 



A host cell genetically engineered to comprise 
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11. The host cell of Claim 7 wherein the positive 
selection casette is* the SacBII cassette. 

12. A method of positive selection of nucleic acid 
5 clones , comprising: 

(a) cloning a large molecular weight DNA 
fragment into the vector of Claim 3 at the 
cloning site; 

(b) inserting the vector into an appropriate 
10 host cell; and 



13. A method of positive selection of nucleic acid 
15 clones, comprising: 



(a) cloning a large molecular weight DNA 
fragment into the vector of Claim 5 at the 
cloning site; 

(b) inserting the vector into an appropriate 
host cell; and 

(c) growing the host cell under nonrepressed 
conditions. 



(c) 



growing the host cell under nonrepressed 
conditions. 



25 
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